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THE MAILING DATE OF THIS COMMUNICATION. 

• Extensions of time may be available under the provisions of 37 CFR 1 .136(a). In no event, however, may a reply be timely filed 
after SIX (6) MONTHS from the mailing date of this communication. 
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• Failure to reply within the set or extended period for reply will, by statute, cause the application to become ABANDONED (35 U.S.C. § 133), 
Any reply received by the Office later than three months after the mailing date of this communication, even if timely filed, may reduce any 
earned patent term adjustment. See 37 CFR 1.704(b). 

Status 

1)S Responsive to connmunication(s) filed on 27 April 2005 . 
2a)S This action is FINAL. 2b)n This action is non-final. 

3) 0 Since this application is in condition for allowance except for fomial matters, prosecution as to the merits is 

closed in accordance with the practice under Ex parte Quayle, 1 935 CD. 1 1 , 453 O.G. 21 3. 
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5) 0 Claim(s) is/are allowed. 
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Replacement drawing sheet(s) including the correction is required if the drawing(s) is objected to. See 37 CFR 1.121(d). 
11 )□ The oath or declaration is objected to by the Examiner. Note the attached Office Action orfomi PTO-152. 
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DETAILED ACTION 



1. 



This after is responsive to communications: Amendment filed 4/27/2005. 



2. 



Claims 1-20 are pending and claims 1,14 and 20 are independent claims. 



3. 



This office action is made Final. 



Claim Rejections - 35 USC §103 



4. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this OfHce action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 



5. Claims 1-3, 14-16, 20 are rejected under 35 U.S.C, 103(a) as being unpatentable over 
Meyerzon et al. (herein after Meyerzon) U.S. Patent No. 6,638,314 Bl filed 6/26/1998 in 
view of Lawrence et al. (herein after Lawrence) U.S. Patent No. 6,289,342 Bl filed 
5/20/1998 and in further view of Blumenthal U.S. Patent No. 6,026,409 filed 9/26/1996. 

In regard to independent claim 1, Meyerzon discloses retrieving a web document at an 
address and extracting contents of the web document for rendering an intermediate dynamically 
constructed in-memory web page representation of the web document at a hub processing unit 
which is formatted as if displayed for viewing on an end-user's web browser (Meyerzon Col 7 
Lines 60-65 and Col 8 Lines 15-20 i.e. web crawler program searches remote server computers 
connected to the network for electronic documents and retrieves electronic documents and 
associated data and a browser displays documents to a user); loading secondary documents 
associated with the web document in order to render the secondary documents as part of the in- 
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memory web page representation (Meyerzon Col 8 Lines 26-35 i.e. the client computer transmits 
data to a search engine, the search engine examines its associated index to find documents and 
returns the documents which are secondary documents and lists the documents for the user to 
view), wherein the secondary documents include one or more images with textual content 
embedded therein (Meyerzon Col 9 Lines 44-50 i.e. visual element m include text and hyperlink 
to an image); analyzing and summarizing the in-memory web page representation to produce a 
text map for the web page document of the textual contents (Meyerzon Col 10 Lines 13-16 i.e. 
passes the lists of properties and text to the indexing engine and the indexing engine creates an 
index, which is used by the search engine in subsequent searches). 

Meyerzon does not specifically mention using optical character recognition on the 
images to extract textual content for adding to the textual map for the web page document. 
However, Lawrence mentions extracting data using optical character recognition (Lawrence Col 
7 Lines 51-56 i.e. conversion to electronic form by use of OCR). It would have been obvious to 
one of ordinary skill in the art at the time of the invention, to apply Lawrence to Meyerzon, 
providing Meyerzon the benefit of extracting content from a document using OCR, which is 
quicker the typing out an entire document manually by hand. 

Meyerzon does not specifically mention wherein the hub processing unit renders the in- 
memory webpage prior to analyzing and summarizing the in-memory webpage. However, 
Blumenthal mentions a document that can be rendered prior to user actions (Blumenthal Col 17 
Lines 45-53). It would have been obvious to one of ordinary skill in the art at the time of the 
invention, to apply Blumenthal to Meyerzon, providing Meyerzon the benefit of rendering the 
document prior to the user action to ensure the correct page is being analyzed and summarized. 
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In regard to dependent claim 2, which depends on claim 1, Meyerzon discloses 
wherein the retrieving the web document at an address further comprises retrieving a document 
at an address selected from the group of addresses consisting of a nodal address, a network 
address, a URL and equivalents (Meyerzon Col 21 Lines 1-11 i.e. a request to retrieve a list of 
electronic documents and retrieving a set of document address specifications corresponding the 
electronic documents). 

In regard to dependent claim 3, which depends on claim 1, Meyerzon discloses 
wherein the one or more images with textual content embedded therein include at least one of an 
in-line GIF image and an in-line JPEG image. (Meyerzon Col 9 Lines 37-46 i.e. an image is 
retrieved to display on a web page and it is well known in the art the images displayed on web 
pages can be a gif and jpeg image). 

In regard to dependent claim 7, which depends on claim 1, Meyerzon discloses 
initializing a first list with seed values (Meyerzon Col 17 Lines 25-26 i.e. assigning a current 
crawl number to the current web crawl); checking if there are any URLs to be processed and in 
response that any URL exists to be processed then performing the follow sub-steps of (Meyerzon 
Col 17 Lines 28-29 i.e. determine whether an electronic document has been retrieved); 
determining if a URL is in a second list; and in response that a URL is not in the second list then 
performing the following sub-steps of : inserting the URL into the first list; scheduling the URL 
for crawling; crawling the URL when scheduled to do so; removing the URL from the first list 
after the scheduled crawling; entering the URL into the second list (Meyerzon Col 9 Lines 64 
and Col 10 Lines 1-11 i.e. history map checks each hyperlink URL to determine if it is already 
listed in the history map, if not the URLs are added and are marked as not being crawled and 
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added to the transaction log. The history map includes a number crawled and number modified 
data); and repeating the checking step until there are no more URLs to be processed; where if 
the determining step determines that the URL is in the second list then repeating the checking 
step until there are no more URLs to be processed, (Meyerzon Col 12 Lines 1-17 i.e. retrieves 
and processed a URL until there are none left in the transaction log) 

In regard to dependent claim 8, which depends on claim 7, Meyerzon discloses 
wherein the sub-step of initializing a first list with seed values further includes the list being a 
URL pool. (Meyerzon Col 7 Lines 65:67 i.e. retrieving a processing URLs from the transaction 
log) 

In regard to dependent claim 9, which depends on claim 7, Meyerzon discloses 
wherein the sub-step of determining if a URL is in a second list further includes the second list 
being a visited pool (Meyerzon Figure 4 shows a column indicating the number crawled and 
modified) 

In regard to dependent claim 10, which depends on claim 7, Meyerzon discloses 
wherein the sub-step of crawling further comprises the sub-steps of: issuing an HTTP command 
to a web server named in the URL; receiving contents of an HTML page as a result of the issued 
HTTP command; and passing on the contents of the HTML page to a Page Rendering 
subroutine. (Meyerzon Col 8 Lines 26-35 i.e. the client computer transmits data to a search 
engine, the search engine examines its associated index to find documents and returns the 
documents which are secondary documents and lists the documents for the user to view) 

In regard to dependent claim 11, which depends on claim 10, Meyerzon discloses 
receiving the contents of the HTML page in the Page Rendering subroutine; building an in- 
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memory representation of a layout for the HTML page and if more data is needed to properly 
form the representation, then performing the sub-steps of (Meyerzon Col 7 Lines 60-65 and Col 
8 Lines 15-20 i.e. web crawler program searches remote server computers connected to the 
network for electronic documents and retrieves electronic documents and associated data and a 
browser displays documents to a user); requesting additional web-based information; gathering 
this additional web-based information; inserting any URLs associated with this additional web- 
based information into the second list and a URL cache (Meyerzon Col 9 Lines 37-46 i.e. an 
image is retrieved to display on a web page); building a final amended representation; and 
forwarding the final amended representation to an Extraction subroutine; wherein, if no more 
data is needed to properly form the in-memory representation, then forwarding the in-memory 
representation to the Extraction subroutine, (Meyerzon Col 16 Lines 32-44) 

In regard to dependent claim 12, which depends on claim 11, Meyerzon discloses 
accessing a set of memory structures of the Page Renderer (Meyerzon Col 6 Lines 23-60 i.e. 
accessing local and remote memory devices); copying a text portion of the structures into a text 
map (Meyerzon Col 15 Lines 15-16 i.e. copying all of the history map entries into the transaction 
log as entries); inspecting any in-line GIF and JPEG image references in the memory structures 
(Meyerzon Col 9 Lines 37-46 i.e. an image is retrieved to display on a web page and it is well 
known in the art the images displayed on web pages can be a gif and jpeg image); extracting 
alternate text attributes (Meyerzon Col 5 Lines 7-8 i.e. extracting data from each of the retrieved 
documents); adding the alternate text attributes to a text map (Meyerzon Col 2 Lines 48-51 i.e. 
information from the electronic document retrieved from the web crawl is stored in an index); 
extracting text content from the GIF and JPEG images; adding text content from the images to 
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the text map (Meyerzon Col 9 Lines 37-46 i.e. an image is retrieved to display on a web page and 
it is well known in the art the images displayed on web pages can be a gif and jpeg image Col 5 
Lines 7-8 i.e. extracting data from each of the retrieved documents Col 2 Lines 48-51 i.e. 
information from the electronic document retrieved from the web crawl is stored in an index); 
and forwarding the text map to a Page Summarizer subroutine, (Meyerzon Col 9 Lines 64 and 
Col 10 Lines 1-11 i.e. history map checks each hyperlink URL to determine if it is already listed 
in the history map, if not the URLs are added and are marked as not being crawled and added to 
the transaction log. The history map includes a number crawled and number modified data) 

Meyerzon does not specifically mention invoking an optical character recognition 
engine; analyzing any in-line GIF and JPEG images using the optical character recognition 
engine for text content. However, Lawrence mentions extracting data using optical character 
recognition (Lawrence Col 7 Lines 51-56 i.e. conversion to electronic form by use of OCR). It 
would have been obvious to one of ordinary skill in the art at the time of the invention, to apply 
Lawrence to Meyerzon, providing Meyerzon the benefit of extracting content from a document 
using OCR, which is quicker the typing out an entire document manually by hand. 

In regard to dependent claim 13, which depends on claim 12, Meyerzon discloses 
receiving a text map from the Page Extractor subroutine; processing the text map in an 
application-specific manner (Meyerzon Col 2 Lines 48-51 i.e. information from the electronic 
document retrieved from the web crawl is stored in an index to begin the routine); applying data 
extraction patterns to the text map (Meyerzon Col 5 Lines 7-8 i.e. extracting data from each of 
the retrieved documents); translating resultant data from the applying step; forwarding any 
URLs present in the text map to a manager subroutine; and forwarding any extracted data and 
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metadata to application logic, (Meyerzon Col 9 Lines 64 and Col 10 Lines 1-11 i.e. history map 
checks each hyperlink URL to determine if it is already listed in the history map, if not the URLs 
are added and are marked as not being crawled and added to the transaction log. The history map 
includes a number crawled and number modified data) 

In regard to independent claims 14 and 20, claims 14 and 20 in addition to the 
following reflect similar subject matter claimed in claim 1 and are rejected along the same 
rationale. (Meyerzon Col 20 Lines 13-14 i.e. computer readable medium having computer 
executable instruction and Col 20 Lines 23-24 i.e. a system for retrieving stored information) 

In regard to dependent claim 15, which depends on claim 14, claim 15 in addition to 
the following reflect similar subject matter claimed in claim 2 and are rejected along the same 
rationale. (Meyerzon Col 20 Lines 13-14 i.e. computer readable medium having computer 
executable instruction) 

In regard to dependent claim 16, which depends on claim 14, claim 16 in addition to 
the following reflect similar subject matter claimed in claim 3 and are rejected along the same 
rationale. (Meyerzon Col 20 Lines 13-14 i.e. computer readable medium having computer 
executable instruction) 

6. Claims 4-6 and 17-19 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Meyerzon et aL (herein after Meyerzon) in view of Lawrence et al. (herein after 
Lawrence) as applied to claim 1 and in further view of Hobbs U.S. Patent No. 6,523,022 Bl 
filed 7/7/1999. 
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In regard to dependent claim 4, which depends on claim 1, Meyerzon does not 
specifically mention wherein the loading secondary documents further comprises the loading of 
secondary documents including one or more Java applets with textual content embedded therein. 
However, Hobbs mentions that Java applets are used (Hobbs Col 28 Line 35). It would have 
been obvious to one of ordinary skill in the art at the time of the invention, to apply Hobbs to 
Meyerzon, providing Meyerzon the benefit of using Java Applets for web pages in the process of 
searching the web documents because Java Applets are compatible with many web pages and 
browsers. 

In regard to dependent claim 5, which depends on claim 1, Meyerzon does not 
specifically mention wherein the loading secondary documents further comprises the loading of 
secondary documents including web documents selected from the group of documents consisting 
of in-line frames, frames, and equivalents. However, Hobbs mentions that frames and in-line 
frames are used (Hobbs Col 7 Lines 63 through Col 8 Lines 1-34). It would have been obvious to 
one of ordinary skill in the art at the time of the invention, to apply Hobbs to Meyerzon, 
providing Meyerzon the benefit of using frames and in-line frames for easy viewing for the user. 

In regard to dependent claim 6, which depends on claim 4, Meyerzon does not 
specifically mention wherein the loading secondary documents further comprises the loading of 
secondary documents including one or more Java Script components with textual content 
embedded therein. However, Hobbs mentions that Java applets are used (Hobbs Col 28 Line 35), 
It would have been obvious to one of ordinary skill in the art at the time of the invention, to 
apply Hobbs to Meyerzon, providing Meyerzon the benefit of using Java Scripts for web pages 



Application/Control Number: 09/607,370 Page 10 

Art Unit: 2178 

in the process of searching the web documents because Java Scripts are compatible with many 
web pages and browsers. 

In regard to dependent claim 17, which depends on claim 14, claim 17 in addition to 
the following reflect similar subject matter claimed in claim 4 and are rejected along the same 
rationale. (Meyerzon Col 20 Lines 13-14 i.e. computer readable medium having computer 
executable instruction) 

In regard to dependent claim 18, which depends on claim 14, claim 18 in addition to 
the following reflect similar subject matter claimed in claim 5 and are rejected along the same 
rationale. (Meyerzon Col 20 Lines 13-14 i.e. computer readable medium having computer 
executable instruction) 

In regard to dependent claim 19, which depends on claim 17, claim 19 in addition to 
the following reflect similar subject matter claimed in claim 6 and are rejected along the same 
rationale. (Meyerzon Col 20 Lines 13-14 i.e. computer readable medium having computer 
executable instruction) 

Response to Arguments 

7, Applicant's arguments with respect to claims 1-20 have been considered but are 
moot in view of the new ground(s) of rejection. 

The applicant argues that the webpage is not processed prior to analyzing and 
summarizing (Page 12 Para 1). However newly added prior art Blumenthal mentions a document 
that can be rendered prior to user actions (Blumenthal Col 17 Lines 45-53). It would have been 
obvious to one of ordinary skill in the art at the time of the invention, to apply Blumenthal to 
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Meyerzon, providing Meyerzon the benefit of rendering the document prior to the user action to 
ensure the correct page is being analyzed and summarized. 

Conclusion 

8. THIS ACTION IS MADE FINAL. AppUcant is reminded of the extension of time 
policy as set forth in 37 CFR 1. 136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within TWO 
MONTHS of the mailing date of this final action and the advisory action is not mailed until after 
the end of the THREE-MONTH shortened statutory period, then the shortened statutory period 
will expire on the date the advisory action is mailed, and any extension fee pursuant to 37 
CFR 1 .136(a) will be calculated from the mailing date of the advisory action. In no event, 
however, will the statutory period for reply expire later than SIX MONTHS from the mailing 
date of this final action. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Londra C. Burge whose telephone number is (571) 272-4122. 
The examiner can normally be reached on 8:30am to 5:00pm. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Stephen Hong can be reached on (571) 272-4124. The fax phone number for the 
organization where this application or proceeding is assigned is 703-872-9306. 
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Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Pubhc PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). 
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